rank | frequency | n-gram |
---|---|---|
1 | 62925 | -n |
2 | 45108 | -e |
3 | 27419 | -s |
4 | 24197 | -r |
5 | 24171 | -t |
rank | frequency | n-gram |
---|---|---|
1 | 47520 | -en |
2 | 20122 | -er |
3 | 10020 | -ng |
4 | 9711 | -te |
5 | 7784 | -es |
rank | frequency | n-gram |
---|---|---|
1 | 11283 | -ten |
2 | 8231 | -ung |
3 | 7170 | -gen |
4 | 4521 | -hen |
5 | 4226 | -ter |
rank | frequency | n-gram |
---|---|---|
1 | 3774 | -chen |
2 | 3340 | -ngen |
3 | 2159 | -nden |
4 | 2014 | -tion |
5 | 1718 | -rung |
rank | frequency | n-gram |
---|---|---|
1 | 2670 | -ungen |
2 | 1579 | -schen |
3 | 1346 | -chaft |
4 | 1315 | -enden |
5 | 1222 | -ation |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings